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ABSTRACT 

Context. Recent observations of HD 49933 by the space-photometric mission CoRoT provide photometric evidence of solar type 
oscillations in a star other than our Sun. The first published reduction, analysis, and interpretation of the CoRoT data yielded a 
spectrum of p-modes with Z = 0, 1, and 2. 

Aims. We present our own analysis of the CoRoT data in an attempt to compare the detected pulsation modes with eigenfrequencies 
of models that are consistent with the observed luminosity and surface temperature. 

Methods. We used the Gruberbauer et al. frequency set derived based on a more conservative Bayesian analysis with ignorance priors 
and fit models from a dense grid of model spectra. We also introduce a Bayesian approach to searching and quantifying the best model 
fits to the observed oscillation spectra. 

Results. We identify 26 frequencies as radial and dipolar modes. Our best fitting model has solar composition and coincides within 
the error box with the spectroscopically determined position of HD 49933 in the H-R diagram. We also show that lower-than-solar 
Z models have a lower probability of matching the observations than the solar metallicity models. To quantify the effect of the 
deficiencies in modeling the stellar surface layers in our analysis, we compare adiabatic and nonadiabatic model fits and find that the 
latter reproduces the observed frequencies better 

Key words, stars: late-type - stars: oscillations - stars: fundamental parameters - stars: individual: HD49933 - techniques: photometric 



1. Introduction 

■ CoRoT (Convection, Rotation, and planetary Transits) is a space 
. mission focusing on aste roseismology and the detectio n of ex- 
' oplanetary transits. See 'Boisnard & Auvergne' ('2006') for an 
overview of the technical details and Baglin et al. ( 2006) for a re- 
view of the asteroseismology aspects of the mission. HD 49933 
(F5 V, my = 5.77) is a main sequence star with an effective tem- 
perature of about 6500 K, log g ~ 4.0, L/Lo 3 .6, and [Fe/H] = - 
0.38. The determination of these values is described in our 
Sect.|2] HD 49933 is similar to Procyon, except with a lower 
metal abundance. HD 49933 was observed during the initial run 
of CoRoT from February 6 to April 7, 2007, only little more than 
one month after launch of CoRoT on December 27, 2006. 

The reduction of the CoRoT photometry to the N2 data for- 
mat is described in Appourchaux et al. (2008). They also pro- 
vide the first frequency determination, a preliminary echelle di- 
agram, and a mode identification for HD 49933. Because their 
conclusions differ from ours we review their procedure here 
highlighting areas where we have adopted a different set of 
premises. They use a maximum likelihood estimator technique 
to fit Lorentzian mode profiles to the observed power spectrum 
with symmetric rotational splitting components included with 
the non-radial modes. In order to reduce the number of free pa- 
rameters, they assume the same line width for modes of the same 
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radial order, and they assume the mode heights are related to 
each other by fixed ratios. For the rotational splitting compo- 
nents, they assume the splitting is symmetric in frequency with 
the split-mode height ratios determined from the inclination an- 
gle parameter. 

Our approach to identifying frequencies jGruberbauer et al] 

I2OO9I hereafter Paper I) is distinct, in that we emphasize the 
reliability of the frequency detections over the quantity of de- 
tected frequencies. Without prejudging the identity of a mode, 
we first simply ask if a mode is statistically visible in the data. 
Then when we have ascertained its existence above some statis- 
tical measure, we try to fit stellar models to the resultant list of 
frequencies. When comparing observed frequencies with stellar 
model frequencies, even a few incorrectly identified frequencies 
can significantly complicate the analysis or even lead to a misin- 
terpretation. 

In Paper I we describe the fitting of solar-type p-modes as 
a parameter estimation problem without the need to give pref- 
erence to any specific model. A Markov-Chain, Monte Carlo 
technique for a Bayesian analysis of Lorentzian profiles was 
used rather than fitting complex models to the observations. 
Specifically, we did not make any assumptions about mode 
heights, knowing that noise in the data, itself, can affect the 
observed heights of individual modes. Rather, we used our fits 
to the mode heights compared to the noise level to character- 
ize the viability of the mode being detected. We, therefore, only 
assumed the Lorentzian profiles have the same line width. This 
assumption might distort the correct determination of the mode 
height and line width parameters, because it is known, e.g., for 
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the Sun (e.g. IChaplin et al.l l2005h . that the mode hfetime is a 
function of frequency. To fix the mode line widths to a single 
parameter, however, should not (or only marginally) influence 
the determination of the mode frequencies, which is the param- 
eter we are most interested in. We extracted a number of radial 
and I - 1 modes. We found no evidence in the CoRoT data of 
statistically signifiant / = 2 modes or for rotationally split com- 
ponents. If they exist, of course, these components are below the 
thr eshold of our mode-height -detection criterion. We argue that 
the I Appourchaux et al.l (12008 ^ approach is too restrictive in that 
it presumes specific characteristics of the modes identity. At this 
early stage in the analysis of such stars, we are not comfortable 
making any assumptions about the modes even at the expense of 
identifying fewer modes. 

In Sect.|2]we re-examine the fundamental parameter deter- 
minations of HD 49933. In Sect.|3]we describe two methods for 
searching large grids of stellar models for a best fit between 
observed and model frequencies. One of the methods, again a 
Bayesian approach, allows us to estimate the statistical likeli- 
hood of the fit compared to other models in our grids. Our best 
fitting model has solar composition and is located in the H-R 
diagram within the spectroscopically determined error box of 
HD 49933. 



2. Fundamental parameters 

Eight different values for the effective temperature, 
based on spectroscopy and photo metry and rangi n g from 
6467 K to 6780 K are cited by iGillon & MagaSl (l2006h . 
iBlackwell & Lynas-Gravl 0998) determined the effective tem- 
perature to be 6512 K by using the IR flux method. In their 
paper on the CoRoT aster oseismic observations of HD 49933, 
lAppourchaux et al.l (l2008h adopt an effec tive temperature o f 
6780+130K, which originally comes from'Br untt et al.1 (l2008l) . 
Another frequently cited value for the effective temperature is 
6576±9 8K, which is derived frorn Stroemgren photometry us- 
ing the Moon & Dworetskvl ( Il985h calibration. This procedure 
also provides a surface gravity of log g = 4.30+0.09. Many of the 
published temperature values are based on standard photometric 
calibrations or automatic spectroscopic determinations. As a 
consequence of the large scatter of the published values for this 
essential stellar parameter we performed our own temperature 
determination using high-quality spectroscopic observations. 

In February 2006, 10 high-resolution spectra (R~ 115 000) 
were obtained for HD 49933 with the HARPS spectrograph at 
the ESO-LaSilla 3.6m telescope. We retrieved these spectra 
from the ESO archive and increased the signal-to-noise ratio 
(~500 in a 0.5 A continuum bin around 5000 A) by co-adding. 
The continuum normalization was done with a low order poly- 
nomial fit to regions free of visible spectral lines. 

To derive a consistent and reliable value for Teff we com- 
pared the observed hydrogen line profiles with synthetic spec- 
tra computed with Synth3 ( KochukhoY 2007) . In the given tem- 
perature range the hydrogen lines are very sensitive to tempera- 
ture variations, are less sensitive to logg variations, and depend 
on the metallicity of the at mosphere. We calcu lated model at- 
mospheres with LLmodels dShulvak et al.l 12004 ). an LTE code 
that uses direct sampling of the line opacities and allows mod- 
els to be computed with an individualized abundance pattern. 
Mgi line wings (AA 5167 5172, 5183 A; Fuhrmann et ai][T997h 
and the ionization equilibrium of several additional elements 
were used to determine the gravity. We started our model at- 
mosphere iteration with the most convincing value of Teff, based 
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Fig.l. Observed Hfi and Mgi Une profile of HD 49933 (gray 
line), compared to synthetic profiles. The thin black lines cor- 
respond to synthetic line profiles assuming Teg and log g to be 
6450 K and 3.9, respectively. Left panel: The dashed lines shows 
the synthetic profile by increasing (lower line) and decreasing 
(upper line) Teff by 50 K. For plotting reasons we excluded the 
metallic lines from the synthetic hydrogen line profiles. Right 
panel: The dashed black line shows the synthetic profile by in- 
creasing log § by 0. 1 dex. 



on models with solar abundances, and a surface gravity derived 
from photometry. We derived the abundances of several elements 
from equivalent widths and then determined log g from a set of 
models using the previously obtained Teff and abundances. With 
this value for the gravity we determined anew the abundances 
and used them to improve Teff. After two iterations we con- 
verged to Teff = 6500+50 K and log^ = 4.0 ±0.15, which is sig- 
nifican tly cooler than the value adopted bv lAppourchaux et al.l 
(I2008h . The small error in Teff can be explained by the high 
temperature sensitivity of the hydrogen line profiles (see left 
panel of Fig.[TJ. Regardless, for the following considerations 
we adopt a more conservative error for the effective temper- 
ature of +75 K. The atomic line parameters were taken from 
file VALD database jPiskunov et all 119951: iKupka et all 119991: 
iRvabchikova et al. I ll999h . except for the Van der Waals broad- 
ening constants, which were taken from Fuhrmann et al. ( 19971). 
The quoted error for log g is similar to other st ars with compa- 
rable effective temperatures, such as Procyon ( iFuhrmann et al.l 
IT997h . and using the same method. 

It is useful in asteroseismic modeling to know the metallic- 
ity of the star. According to our spectroscopic analysis the at- 
mospheric metallicity is Z = 0. 008±0.002 where iro n is under 
abundant compared to the Sun (i Grevesse et al.|[T996l) . in agree- 
ment with published values. Carbon and oxygen, on the other 
hand, have near solar abundances. This variation could be due 
to the effects of gravitational settling and radiative levitation in 
the thin convective envelope of the star. The true interior abun- 
dance, relevant for stellar models and pulsation analysis, is un- 
certain. Ideally, stellar models should include the effects of ele- 
ment diffusion. Further details of the spectros copic analysis with 
a discu ssion of the literature can be found in IRvabchikova et all 
(120091) . 

Using the Hipparcos parallax n - 33.69 + 0.42 mas 
dvan L eeuwen 2007) we abtain the absolute visual magni- 
tude My - 3.4 1 8 + .026. Interpolating the tables of 



iLeieune & SchaereJ (1200 lb we obtain a bolometric correction 
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BCv = -0.054 ± 0.005 (for [Fe/H] = -0.4) which is more than 
twice the value used bv Appourchau x et al.l ( l2008l) . With Mboj 
= 4.75 we obtain L/LO= 3.58+0.10 (Appourchaux et al. 2008 
use 3.39+0.08) and a corresponding radius of R = 1.46+0.05 Ro. 



3. Asteroseismic analysis 

For our analysis we use the frequencies given in Paper I (see 
Tab. 5 therein) where we exclude P25 and P26 following the 
suggestion of the authors. They did not assign credibility to 
these values arguing with the ambiguity in their marginal fre- 
quency distribution, which is apparent in the listed large un- 
certainties. In Paper I, no specific mode identifications to the 
frequencies were assumed and mo de heights were left uncon- 
strained. Ap pourchaux et all (l2008l) . on the other hand, assumed 
specific mode identifications (/ = 0, 1, and 2) and fixed relative 
mode heights. 

We search a dense and extensive grid of stellar model oscil- 
lation spectra looking for the model spectra that matches best 
the observed frequencies to within the known uncertainties of 
the observed frequencies. We use a simple formula to quan- 
tify how well the spectra match each other. This approach is fast 
and efficient and can handle sparse and contaminated oscilla- 
tion spectra. Although a lower x^ does indicate a higher likeli- 
hood, the computed;t'^s cannot be used to associate a confidence 
level to the inferred parameters. We, therefore, introduce a new 
method based on a Bayesian approach that uses the error dis- 
tributions of the observed frequencies to quantify in terms of a 
probability how well the frequencies of a given model match the 
observations. 

The grids of models were constructed using the Yale Stellar 
Evolution Code (YREC; 'Guent her et all 119921) . The grids in- 
clude models with masses ranging from 0.6 to 2.00 M0 in steps 
of 0.005 Mo and from 2.00 to 5.00 Mo in steps of 0.01 Mo. 
Each evolutionary track runs from the zero age main sequence 
to halfway up the giant branch. The solar grid is based on a near 
solar composition and calibrated mixing length parameter (Z = 
0.02, Y = 0.27, ci'=1.8). The low Z grid, which has the same mass 
resolution but is not nearly as extensive in its coverage, is based 
on the same mixing length parameter and (Z, Y) = (0.008, 0.24). 

The constitutive phy sics o f the models include OPAL98 
dlglesias & RogersI Il996l) . the [Alexander & FergusonI (11994 
opacity tables, and the Lawrence Livermore National Laboratory 
equation of state tables (Rogers 1986; Rogers et al. 199 g). The 
standard Bohm-Vitense (IVitensel 119531: IBohm-VitensJfl958l) 
mixing length theory is used to model convection. The mixing 
length parameter used to describe the temperature gradient in 
convective regions was adjusted from calibrated solar models 
and rounded to tw o significant di gits. We have assumed stan- 
dard solar mixture (iGrevesse et al.lll996l) . Although the effects 
of helium and heavy element difiiision are included in the model 
physics, diffusion is automatically turned oS shortly after turn- 
off to avoid known computational problems associated with very 
thin convection zones. Each model was resolved into approxi- 
mately 2000 shells, with two-thirds of the shells placed in the en- 
velope and atmosp here. Guenthers nonradial nonadiabatic stellar 
pulsation program (iGuentherl 19941) was used to compute the adi- 
abatic and nonadiabatic pulsation spectra of each of the models 
in the grid. The nonabatic code includes the effects of radiation 
on the modes but does not include the, probably equally impor- 
tant, effects of convection. 



3.1. x^ mode matching 

We begin our model analys is using the mode ma tching 
method first introduced by iGuenther & Brownl (l2004i) . The 
method is basic. Large grids of stellar models and their spectra 
are constructed. The stellar model spectra are compared one-by- 
one to the observed spectrum. How well the two spectra compare 
is quantified by the following 



X 



-y 



(Vm,i - Vojf 
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/=1 



where Voj and v^j are the observed and corresponding model 
eigenfrequency of the i-th mode, respectively. The observational 
uncertainty is given by cto, and No corresponds to the total num- 
ber of modes used for the fit. This approach provides a good 
estimate of how well a set of observed frequencies coincides 
with a model eigenspectrum. A value of x^ ^ ^ -0 means that, 
on average, the model agrees within the uncertainties of the 
observed frequencies. In other words, the x^ provides a mea- 
sure for the average normalized deviation between observed and 
model frequencies and should not be misi nterpreted in terms of 
a probability. For more d etails, see, e.g.. iGuenther et al. I (l2005l) 
or lKallingeretal.l(l2008h . 

To find a best model fit to the 26 observed frequencies (Tab. 5 
in Paper I) we searched for modes from / = to 3, inclusive. 
In our range of interest in the H-R diagram the "resolution" of 
our grid, i.e., the step in frequency for a mode of given degree 
and radial order from model to model, is about 4/iHz (~0.2% 
of the observed frequency with the highest amplitude). This is 
quite high compared to the uncertainties in the observations. We 
therefore interpolated our grid linearly along the evolutionary 
tracks, effectively increasing the grid density by a factor of 10. 

The best fitting model using nonadibatic frequencies, in 
other words, the model with the lowest x^ - 0.79, matches the 
observed frequencies with 13 radial and 13 dipole modes. The 
best fitting model using adiabatic frequencies has x^ - \ .0. The 
nonadiabatic frequencies provide a better fit than the adiabatic 
frequencies. The model itself, though, is only slightly different. 
We list the model fit properties in Tabl.[T] In the top panel of 
Fig.|2]we provide a direct comparison between the observed fre- 
quencies and the p-mode frequencies of the best fitting model. 
The echelle diagram shows that both the adiabatic and nonadi- 
batic model reproduce not only the general pattern of the ob- 
served frequencies, but also the detailed structure of the mode 
sequences. 

A theoretical H-R diagram is shown in Fig. |4] representing 
the subset of the stellar model grid used for the pulsation analy- 
sis, the error box for the effective temperature and luminosity of 
HD 49933, and the best nonadiabatic model fits to the observed 
frequencies with x^ less than 4.0. The adiabatic model fits yield 
a nearly identical plot. The best fitting nonadiabatic model, lo- 
cated within the error box for HD 49933, has an effective tem- 
perature of Teff = 6484 K, a luminosity of L = 3.52 Lo, a radius 
of R = 1 .49 Ro, a mass of 1 . 325 Mo, and an age of 2. 1 5 Gyr (see 
Tabl.[T]l. 

Even though our model searches included Z = 0, 1,2, and 
3 p modes, our best adiabatic and nonadiabatic model fits do 
not need to use any I - 2 and 3 modes and , as a consequence, 
our mode identifications are different from [Appourchaux et al.l 
(120081) . This is not an issue of insufficient frequency resolution. 
If they could be detected in the data, the / = 2 and 3 mode 
frequencies form a sequence in the echelle diagram separated 
from the radial mode sequence by a spacing that is several times 
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greater than the frequency uncertainties. Unhke the I — and 2 
modes, the / = and 1 are well separated and nearly independent 
of each other 

If we search our nonadiabati c grid for a best match betw een 
the model frequencies given by Appourc haux et all (1200 8*) and 
if we use only those identified by them to come from / = and 1 
modes we, find no model with ax^ better than 19.6. This model 
(1.325 Mo, 6487 K) is very close to our best model (see Tabl.[Tll 
an d indicates that - on averag e - the frequency subset derived 
by 'App ourchaux et al] ( 1200 8') is similar to our frequency set, 
which is not really surprising as we are using the same data. 
The high value, however, indicates that their frequency val- 
ues scatter quite a bit around those predicted by (our) models 
and the frequency uncertainties are underestimated (by up to a 
factor of 5) in contrast to our frequencies and uncertainties, de- 
rived with a Bayesian approach, and which result inax^ = 0.79 
(see Tabl. 1 and Fig.|3]l. But more importantly, the mode identi- 
fications in our analysis a re different from that is advocated by 
lAppourchaux et al ] (120081) . If we force our search program to use 
their frequencies and /-value identification, we obtain ax^ > 45 
and a model temperature difference of more than lOOOK. 

The mass and age of our best model fit depends on the as- 
sumed composition of the model grid. We choose primarily for 
reference purposes a solar composition grid with (Y, Z) = (0.27, 
0.02). As discussed in Sect.|2] our model atmospheres indicate 
that HD 49933 is metal poor compared to the Sun, but with solar 
C and O abundances! Therefore, a more realistic composition for 
HD 49933 would be closer to (Y, Z) = (0.24, 0.008) assuming 
normal rates of the Galactic enrichment of helium and metals. 
Because we do not have a full grid for this specific composition 
available, we computed a smaller grid of about 10000 models 
centered on the previously best fitting model. 

Repeating the analysis with the lower Z adabatic and nona- 
diabatic grids, we obtained the echelle diagram for the best fit- 
ting model, which is presented in the bottom panel of Fig.|2] 
Apparently, the low-Z fit is not as good as the solar-Z fit. One 
must be careful, though, not to conclude that HD 49933 has a 
solar Z, since we do not know the intrinsic model uncertainties 
exactly. For our Sun, for example, we know that the frequencies 
of our best solar models differ by up to 0.5% from the observed 
frequencies for higher radial orders, i.e., higher frequencies. This 
discrepancy is attributed to known deficiencies in the modeling 
of the surface layers. 

With frequency uncertainties in asteroseismic observations 
approaching ±0.5 yuHz, it is now essential to consider the effects 
of the surface layers on the models. For the Sun we know that our 
inadequate modeling of the surface layers, especially in the re- 
gion of the superadiabatic layer, results in model frequencies that 
depart from the observed solar frequencies by more t han lOyuHz 
at the highest frequencies (see iGuenther et anil996 , and refer- 
ences therein). The model deficiencies have been identified as 
including uncertain low temperature opacities, inadequate model 
atmosphere, incomplete inclusion of nonadiabatic effects due to 
radiation loss and convection, and inadequate mixing length ap- 
proximation used to model convective energy transport. 

We ourselves are working toward including proper stellar at- 
mosphere models in the model physics and improved structure 
descriptions in the outer convective envelopes based on 3D nu- 
merical simulations of convection dDemarque et al.|[l999l) . But 
this research is still far from completion. At this time, we can 
either desensitize our model fitting to the effects of the surface 
layers or try to estimate their size and note their effect on our 
model fit conclusions. Because we want to directly address the 



question of how important and what the influence is of surface 
effects on our model fits we have chosen the latter 

Regardless, there are several methods in use today to remove 
the sensitivity of model-fitting to th e surface layers t hat are worth 
reviewmg m our current context. 'Ki eldsen et all (|2008) use a 
power-law fit to correct for the effect of the surface layers on the 
frequencies. The power-law exponent is determined from best 
model fits to the sun and then the scaling factors are determined 
from the model fits, themselves, to the observations. The method 
relies on correctly identifying the observed modes (their / and n 
values) and assumes that the surface layers scale from the Sun. 
Although this approach is useful for stars that are known to be 
like the Sun, we believe it makes too many assumptions about 
the nature of the surface layers of stars and may cause to mis- 
leading results for other stars. 



Roxburgh jRoxburgh & Vorontsovl l2000i 1200 iL 120031: 
Roxburgh 20()5j) recommends either using ratios of small 
to large spacings, which are insensitive to surface layers, or 
computing internal phase shift corrections for the models. The 
former is easier to implement, but the latter will ultimately yield 
more information about the location and size of the surface layer 
effects. The latter method, though, still has to be fully integrated 
into a x^ minimization approach that does not assume / and 
n identifications. Both approaches do yield superior interior 
model fits to the observations. They do not teach us what is 
missing from our models. 

To obtain an estimate of the size of the effect of the missing 
physics in our surface layer modeling, we compared model fits 
using adiabatic frequencies to fits using nona diabatic frequen- 
cies. The nonadiabatic frequency computation ( iGuentheiil 19941) 
accounts for energy gains and losses due to radiation, primar- 
ily occurring in the superadiabatic layer of the star. We did not 
compute the gains and losses caused by convection nor do we 
include the effects of turbulent pressure on the modes. For the 
Sun, nonadiabatic radiation effects account for almost half of 
the discrepancy between observed and modeled frequencies. 

The model-fitting results as represented in the Echelle di- 
agrams of Fig.|2] show that the nonadiabatic mode frequencies 
are close to the adiabatic frequencies, only showing a difference 
greater than the observational uncertainties above 2200 yuHz. In 
terms of^^ fits, we do not see a significant difference. The nona- 
diabatic frequencies are preferred with a slightly lower x^ (see 
Tab.[Tli. The best-fit models are also nearly identical. For the cur- 
rent set of observed frequencies below 2200 yuHz, surface effects, 
i.e., inadequacies in the modeling of the surface layers, are com- 
parable to the observational uncertainties. Figure|3] shows this 
most dramatically. We plot the frequency difference between the 
adiabatic and nonadiabatic model frequencies versus frequency. 
We also plot the frequency difference between the observed fre- 
quencies and the nonadiabatic frequencies with error bars show- 
ing the observed frequency uncertainty. The scatter of the ob- 
served minus nonadiabatic frequencies and the error bars are 
larger than the frequency difference between the adiabatic and 
nonadiabatic frequencies up to 2200 //Hz. This suggests that the 
surface effects are less than the error bars. 

The best-fit, nonadiabatic metal-poor model has x^ - 2.26, 
a mass - 1.205 Mo, an age - 2.98 Gyr, a radius = 1.43 Ro, an 
effective temperature = 6644 K , and a luminosity = 3.57 Lo. 
The low Z model is located outside the error box of HD 49933 
in the HRD, which would indicate the need for considerably im- 
proved stellar models. The adiabatic model fit is not as good with 
a;(/2 ^4.16 (see Tabl.[B. 
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3.2. Bayesian mode matching 

The -method used so far gives a reasonable result, and one 
can intuitively "judge" (also based on the quite different val- 
ues) which of the two fits is better. As noted above, this does 
not imply that the derived model parameters (mass, age, etc.) 
are correct since they depend on the assumed model physics. A 
disadvantage of the ;t'^-method is the lack of comparability of 
different model fits in a statistically satisfying way. Because we 
have normalized the by assuming the frequencies are inde- 
pendent, which they are not, we can only compare fits when 
the number of frequencies is the same. In other words, we can 
identify the model that fits a given set of observations best, but 
we cannot quantify in terms of a probability how much better 
that model fit is to another set of frequencies. Similarly, fits from 
different grids cannot be compared directly, because we have no 
a priori way of normalizing thex^- In the present case, the fit to 
the solar-calibrated grid appears better than the fit to the metal- 
poor grid, but we cannot quantify how much better it is. Finally, 
the mode dependencies on the structure are not completely in- 
dependent. For example, pairs of modes defining the small spac- 
ing have similar eigenfunction shapes in the surface layers, only 
departing in the deep interior As a consequence, the x^ results 
normalized by 1/N actually depend slightly on which N modes 
are matched. In the following we outline our approach to inter- 
pret the available information in terms of a probability, based on 
the Bayesian theorem. 

In general, Bayesian techniques have proven to be very 
successful in comparing obs er vations with models such as 
|j0r^ensen & LindegrenI (I2005h or lBazot et al] 12008) , which de- 
termine some stellar parameters using isochrones and observed 
values of effective temperature and luminosity. Here, though, we 
utilize the seismic data to constrain the models. 

From the Gaussian distribution of the observational uncer- 
tainty it follows that the likelihood that a model frequency v,„ 
matches an observed frequency v,, with an uncertainty cto is. 



P(Vm) = 



exp 



2crl 



(2) 



In fact, we subsequently ignore the normalization factor, 
(cTg V27r)"', because it cancels out in the subsequent analysis. 

As with the x^ method, we search our grid of models for 
model(s) whose frequencies fit the observed frequencies best, 
but in this case we are looking for the highest probability rather 
than the lowest In the Bayesian approach, though, we can 
assign an overall probability of the model M matching the ob- 
served frequencies with respect to the entire set of models ac- 
cording to Bayes' theorem as 



p(M\D,I) 



p(M\I)-p(D\M,I) 
P(DJ) 



where 



No 



p{M\I) - and 

Nioi 



P(D\M,I) = Y\P^^ 



(3) 



(4) 



1=1 



are the uniform prior probability for a specific model and the 
likelihood function, respectively. For the former we use the per- 
centage of the total number of observed frequencies (Niot) that 
are used for the fit (No). As aresult, the more frequencies are 
used the higher is the resulting probability. 

The likelihood function is the "and" probability of the 
individual probabilities of the most credible model fre- 
quencies, which were identified in the search for the best 



model fit. In other words, p(D;M,I) is the probability that 
P(ym.\) A Pivma) A - A Piym.N„) fit the corresponding observed 
frequencies. 

The overall model probability is a very sensitive tool because 
a high probability is only assigned if (almost) all observed fre- 
quencies can be matched well. It has to be mentioned that in 
practice we allow a model frequency to be assigned to more than 
one observed frequency. This is especially important for exam- 
ple with rotational split modes where several observed modes 
need to be matched to a single-mode frequency, since the mod- 
els do not include rotationally split modes. The denominator of 
Eq.[3]is a normalization factor for the specific model probability 
in the form of 



p(D,I)^Y,p(Mi\I)-p(D\Mi,I). 



(5) 



Since the uniform priors are the same for all models they cancel 
in Eq.[3] which simplifies to 



P^p(Mi\DJ) 



p(D\MiJ) 
Z%piD\Mj,I)' 



(6) 



The resulting model probabiUty distribution in the model space 
shows regions of highest probability in contrast to others. This 
automatically translates into uncertainties in the fundamental pa- 
rameters of the stellar models by constructing the marginal dis- 
tribution of the corresponding model parameter The normalized 
probability of the most probable model is therefore a measure 
of how likely this model is with respect to the other models of 
the specific grid. We stress that it does not tell us how probable 
the model fit is in an absolute sense. The probability is restricted 
to the space of the models being considered and their associated 
physics. 

When comparing solutions of different fits (e.g., for differ- 
ent model grids, or using different sets of observed frequencies) 
a more robust value for the "quality of the fit" is the geomet- 
ric mean of the frequency probabilities (Eq.|2]i of the best fitting 
model: 



(7) 



The best-fit model identified by the Bayesian approach is the 
same model as that identified by the x^ algorithm with the 
added benefit that the Bayesian formalism provides ranges in 
which model parameters can be constrained. A full mathemati- 
cal description and extensive tests of this probabilistic approach 
will appear separately (Gruberbauer et al., in preperation). As 
a demonstration we use the Bayesian technique to compare the 
probabilities of the best fitting models to the solar and low Z 
(nonadiabatic) grids. The best-fit model to the solar grid has an 
overall probability of 0.33 with a quality of the fit of ~0.09. 

Using the -method to find a best fit between model and ob- 
served frequencies we found a ridge of models with;^'^ below a 
given threshold, which are basically located on a contour with a 
constant large frequency separation where the ridge spans quite 
a wide range in temperature and luminosity (of course depend- 
ing on the chosen threshold). The model parameters cannot be 
constrained in an objective manner 

As an example, in Fig.|5] we show the cumulative (i.e., in- 
tegrated or summed) probability distribution functions of the 
models as a function of effective temperature and luminosity, 
respectively. From them, we can derive the most probable model 
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Table 1. Summary of the best fi tting s olar-abundance and metal-p oor adiabatic and nonadiabatic models for the frequencies given 
by A: iGruberbauer et all (l2009l) and B: lAppourchaux et all ( l2008l) . 



Source 


(Y,Z) 


adia./nonadia. 


M 


R 


Teff 


L 


logg 


age 




Q 








[Mo] 


[RO] 


[K] 


[LO] 


[cm-s -J 


[Gyr] 






A 


(0.27, 0.02) 


nonadiabatic 


1.325 


1.49 


6484 


3.52 


4.22 


2.15 


0.79 


0.090 


A 




adiabatic 


1.325 


1.49 


6485 


3.52 


4.21 


2.13 


1.0 


0.081 


A 


(0.24, 0.008) 


nonadiabatic 


1.205 


1.43 


6644 


3.57 


4.22 


2.98 


2.26 


0.043 


A 




adiabatic 


1.205 


1.43 


6641 


3.57 


4.21 


2.99 


4.16 


0.017 


B 


(0.27, 0.02) 


nonadiabatic 


1.325 


1.49 


6487 


3.52 


4.22 


2.14 


19.6 


1.810-5 


B* 




nonadiabatic 


1.600 


1.53 


7447 


6.42 


4.28 


0.18 


44.9 


8.3-10-" 


B 


(0.24, 0.008) 


nonadiabatic 


1.205 


1.43 


6645 


3.57 


4.22 


2.97 


27.5 


3.7-10-'' 










1.46±0.05 


6500±75 


3.58±0.1 


4.0±0.15 









parameters and their uncertainties as, e.g. the median and the 
+ lcr environment. The latter refer to a Gaussian error distribu- 
tion and can easily be replaced by, e.g. a confidence interval. The 
cumulative probability distribution functions, however, yields a 
most probable effective temperature and luminosity of 6484+ 1 K 
and 3.5205±0.001 Lo, respectively. If the frequency uncertain- 
ties were larger, than the high probability would be distributed 
amongst more models leading to a shallower cumulative prob- 
ability distribution, and the uncertainties of the model parame- 
ters would be larger. The uncertainties are not the uncertainties 
within which we can determine HD49933's position in the HD- 
diagram, but they instead correspond to the uncertainties within 
which we can constrain the model parameters within a given 
grid. The uncertainties appear unrealistically small because we 
have not included any assumptions about the model uncertainties 
themselves. 

By applying the Bayesian method to the metal-poor model 
grid, we find the best model fit (identical to the best model) 
has an overall probability of about 0.39 and a quality of the fit of 
~0.043. Interestingly, the overall probability of the best fitting 
model is higher for the metal-poor grid than for the solar one. 
One cannot conclude that the metal-poor grid fit is better because 
the best fitting probability is a measure of the probability of the 
best-fit model compared to all the other models in its grid and 
not to the other models' grid. To directly compare the fits of 
different grids it is necessary to compute the global likelihood, 
i.e. the sum of the individual model probabilities from each grid. 
This is done in the following way. 

The probability that grid A provides better model fits to a 
given set of frequencies than grid B (assuming that the models 
themselves are correct) is given by 

P(A) = ^ ^ : (8) 

^ i:,- p(D\Maj, /) + ^ piDlMBj, I) 

with Na and Nb the total number of models in grids A and B, re- 
spectively. Accordingly, we find the probability that the nonadia- 
batic frequencies of the solar-abundance grid fit the observed fre- 
quencies better than the adiabatic frequencies to be about 0.92, 
which is higher than to the probability of about 0.08 that the 
adiabatic frequencies provide a better fit than the nonadiabatic 
frequenices. It becomes even clearer if we compare the different 
metallicity fits. Whereas the probability that the low Z (nonadi- 
abatic) grid fits the observed frequencies better than the solar Z 
(nonadiabatic) grid is only about 5.7-10-'*, the probability that 
the solar Z grid fits better than the low Z grid is naturally almost 
1 . In other words, of the two grids, the solar Z grid of models is 



a significantly more likely fit to the observations than the low Z 
grid of models. The obvious limitation of this conclusion is that 
it is based entirely on the assumption that the models themselves 
are coiTect. This is why a broader range of model parameters and 
physics needs to be examined before one draw any reasonable 
conclusions. But our goal here, though, is primarily to demon- 
strate the potential of the Bayesian approach. 

Even though the new approach yields excellent results in 
the present case, it is very sensitive to the presence of artifacts, 
i.e., if some of the observed frequencies are not intrinsic to the 
star, then the Bayesian approach will (correctly) not identify any 
models as being probable. This is not an issue for the modes 
we identified for HD 49933 in Paper I, but can be a problem for 
other analyses with lower quality data sets. From Eq.|4]it can be 
seen that a single observed frequency that does not come close to 
a model frequency (and consequently has a very low probability 
of being matched by one of the model frequencies) pushes the 
overall model probability close to zero. This in turn justifies our 
very conservative frequency determination described in Paper I. 
In Gruberbauer et al. (in prep.) we will present a modified ver- 
sion of the algorithm that takes possible artifacts into account. 

4. Conclusions 

In Paper I we used a Bayesian approach to determine pulsa- 
tion frequencies of HD 49933 in photometric data obtained by 
CoRoT. In this paper we have introduced a probability, based on 
Bayesian methodology, to quantify how closely a model matches 
the observed pulsation frequency spectrum within uncertainties, 
compared to other models in a dense and extensive grid of model 
oscillation spectra. 

We obtained the following results from our analysis. 

1. We first compared the 26 frequencies to model oscillation 
spectra in our solar composition grid. We identified a best 
fit with X - 0.79 using nonadiabatic frequencies. The lu- 
minosity and effective temperature of the best model fit lies 
within the uncertainty box for HD 49933 in the H-R diagram. 
The mass, age, and radius of the best model fit is 1.325 Mo, 
2. 15 Gyr, and 1.49 RO. 

2. Owing to the uncertain interior metal abundance, we re- 
peated the search with a lower Y and Z grid of models, (Y, Z) 
= (0.24, 0.008). We identified a best fit with - 2.26. The 
luminosity and effective temperature of the best-fit model lie 
outside the uncertainty box in the HR diagram for HD 49933. 

3. We used a Bayesian approach to assign a probability to the 
model fits and confirmed that the solar Z fit is significantly 
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more probable than the low Z model fit, assuming that the 
models themselves are valid. 

4. We showed th at the mode identification of 
lAppourchaux et al] (|2008|) yields significantly higher 

values than our mode identification. 

5. We showed that the adiabatic and nonadiatic frequencies 
yield nearly identical model fits to the observed modes. For 
the Sun it is known that the nonadiabatic effect accounts for 
a large portion of the surface layer effect. We therefore con- 
clude that the deficiencies in modeling the stellar surface lay- 
ers only have weak effects on the present analysis. 

A summary of the best model fits to our frequencies (de- 
rived in Paper I) and to a subset of frequencies listed by 
lAppourchaux et al] (l2008l) is given in Tabl.[T| In the latter case 
we use only frequencies which the authors attribute to / = or 
1 modes but ignoring their mode identification. Only for case 
B* do we enforce their mode identification, is normalized. 
Lower values indicate a more probable fit. The parameter Q 
is a probability and refers to the quality of the fit (Eq.|7]). Non- 
seismically determined fundamental parameters are given in the 
last row. 

The frequency extraction by lAppourchaux et a D (I2008h 
yielded some frequencies that they interpreted as / = 2 modes 
or rotational split components. Our more conservative analysis 
does not yield any evidence for frequencies of higher order than I 
- 1. When we looked for statistically significant modes adjacent 
to the well-identified modes, we could not find any evidence for 
additional modes or rotational split co mponents. We suggest that 
possibly the a priori assumption by App ourchaux et all (l2008l) 
that the / = 0, 1 , and 2 modes have a fixed ratio of heights may 
have led to spurious I - 2 mode identifica tions. Likewise, we 
came to a different mode identification than lAppourchaux et alj 
(I2008h . 

Our estimates of the surface abundances of Fe, C, and O re- 
veal that Fe is under abundant compared to the Sun and that C 
and O have near solar abundances. Owing to this ambiguity we 
fit the observed frequencies to both a solar composition grid and 
a low Z grid. The convective envelope mass of the best-fit model 
in the solar composition grid is ~0.0008 Mo. 

We note that the abundances of both helium and heavy el- 
ements in HD49933's thin convective envelope will be affected 
by diffusion processes such as gravitational settling and radia- 
tive levitation. Because of computational difficulties we do not 
follow element diffusion when the envelope thins below 0.1% 
of the star's total mass. The mixed metal abundances at the sur- 
face make this star an excellent candidate for detailed modeling 
that includes Y and Z diffusion. A key question we hope to ad- 
dress in a future paper is, whether we can distinguish and isolate 
the effects of diffusion, metal abundance, helium abundance, and 
mixing length parameter. 

We introduced a Bayesian approach to define a probabil- 
ity that the observed frequencies match a given model spec- 
trum within a grid of model eigenspectra. The model prob- 
ability allows one to compare distinct model fits with each 
other, something that the normalized values cannot do. 
Although our analysis of HD 49933 has yielded results that ap- 
pear to be more consisten t with non-asteroseismic data than the 
lAppourchaux et al] (l2008l) results, we do not consider this to be 
the final word on the subject and fully expect that HD 49933 
will provide us with new information about Procy on-like stars. 
We look to future asteroseismic analyses of the CoRoT data to 
determine the depth of HD49933's convective envelope, to de- 
termine its helium abundance, to confirm its location in the H-R 



diagram and phase of evolution, and to test models of Y and Z 
diffusion. Where seismologists had difficulty trying to interpret 
the tentatively identified frequencies in Procyon, a star similar to 
HD 49933, CoRoT's observations of HD 49933 are of such high 
quality that we will soon begin testing the models themselves. 
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Fig. 2. Top panel: Echelle diagram of the observed oscillation 
frequencies and the adiabatic and nonadiabatic eigenfrequencies 
of the best fitting model from the solar abundance model grid. 
The observed frequencies are shown as filled circles with the 
error bars indicating the observational uncertainties. The nona- 
diabatic model frequencies used in the analysis are displayed 
with gray symbols connected by line segments. The adiabatic 
frequencies are indicated by line segments only. Bottom panel: 
Same as above, but for the best fitting metal-poor model. 
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Fig. 3. Top panel: Difference between the adiabatic and nona- 
diabatic eigenfrequencies for the / - and 1 modes of the best 
fitting solar-abundance model and the frequency difference be- 
tween the observed and nonadiabatic frequencies. Black dots in- 
dicate the differences between the observed and the nonadiabatic 
frequencies with error bars for the observations. Bottom panel: 
Similar to above except showing the lAppourchaux et al.l (l2008l) 
frequencies and uncertainties. 
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Fig. 4. Theoretical H-R diagram showing the uncertainty box of 
HD 49933 and a subset of the solar abundance stellar model grid 
(light gray dots). The color scale (in the online version only) 
gives the values for the fits of observed and model frequen- 
cies, where the scale is limited to values lower than 4.0. Colored 
circles and crosses correspond to model fits to the solar abun- 
dance and metal poor grid, respectively. The grid only is show 
for the solar abundance models. The large black crosses indicate 
the positions of the best-fit models. The inserts show the model 
probability (black filled circles) and the x value (gray symbols) 
distribution for the solar calibrated grid as a function of model 
age and mass, respectively. 
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Fig. 5. The cumulative probability distribution functions for the 
effective temperature and luminosity of the solar calibrated grid. 
The dotted lines correspond to the median and the dashed lines 
give the +lcr confidence interval. 



